A Generative Model of Phonotactics

نویسندگان

  • Richard Futrell
  • Adam Albright
  • Peter Graff
  • Timothy J. O'Donnell
چکیده

We present a probabilistic model of phonotactics, the set of well-formed phoneme sequences in a language. Unlike most computational models of phonotactics (Hayes and Wilson, 2008; Goldsmith and Riggle, 2012), we take a fully generative approach, modeling a process where forms are built up out of subparts by phonologically-informed structure building operations. We learn an inventory of subparts by applying stochastic memoization (Johnson et al., 2007; Goodman et al., 2008) to a generative process for phonemes structured as an and-or graph, based on concepts of feature hierarchy from generative phonology (Clements, 1985; Dresher, 2009). Subparts are combined in a way that allows tier-based feature interactions. We evaluate our models’ ability to capture phonotactic distributions in the lexicons of 14 languages drawn from the WOLEX corpus (Graff, 2012). Our full model robustly assigns higher probabilities to held-out forms than a sophisticated N-gram model for all languages. We also present novel analyses that probe model behavior in more detail.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Error-driven Ranking Model of the Acquisition of Phonotactics: How to Keep the Faithfulness Constraints at Bay

A problem which arises in the theory of the error-driven ranking model of the acquisition of phonotactics is that the faithfulness constraints need to be promoted but should not be promoted too high. This paper motivates this technical problem and shows how to tune the promotion component of the re-ranking rule so as to keep the faithfulness constraints at bay. Sections 1-2 introduce the algori...

متن کامل

The Error-driven Ranking Model of the Early Stage of the Acquisition of Phonotactics: an Initial Result on Restrictiveness *

Nine-month-old infants are already sensitive to the distinction between licit and illicit forms (Jusczyk et al. 1993). They thus display knowledge of the target adult phonotactics at an early stage when morphology is plausibly still lagging behind (Hayes 2004) and the acquisition of the native language lexicon has barely begun (Fenson et al. 1994). How can this early stage of the acquisition of...

متن کامل

Voice-based Age and Gender Recognition using Training Generative Sparse Model

Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...

متن کامل

تأثیر آموزش مبتنی بر الگوی طراحی یادگیری زایشی بر میزان یادگیری دانشجویان رشته پرستاری در درس فیزیولوژی

Introduction: Utilizing traditional educational methods does not meet today’s educational needs; Modern educational systems are enabled with new methods of teaching that enrich the teaching- learning process. The purpose of this study was to evaluate the effect of instruction based generative learning design model on nursing student's Physiology learning. Methods: In this study, the pr...

متن کامل

Improvement of generative adversarial networks for automatic text-to-image generation

This research is related to the use of deep learning tools and image processing technology in the automatic generation of images from text. Previous researches have used one sentence to produce images. In this research, a memory-based hierarchical model is presented that uses three different descriptions that are presented in the form of sentences to produce and improve the image. The proposed ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • TACL

دوره 5  شماره 

صفحات  -

تاریخ انتشار 2017